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1. The request filed on December 17, 2002 for a Continued Prosecution Application (CPA) 
under 37 CFR 1.53(d) based on parent Application No. 09/196,574 is acceptable and a CPA has 
been established. An action on the CPA follows. 

2. The applicants 5 arguments from the amendment filed November 13, 2002 have been noted 
and considered, but are deemed moot in view of the following new grounds of rejections. 

3. The following is a quotation of 35 U.S. C. 103(a) which forms the basis for all obviousness 
rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

4. Claims 1-16 are rejected under 35 U.S.C. 103(a) as being unpatentable over Stenger of 
record (DE 3608489A1) in view of Katata et al of record (5,815,601) and Woodfill et al 
(6,215,898). 

Stenger discloses a method of improving image segmentation of a video telephone scene 
as shown in Figures 3 and 4, and substantially the same image processing device and system, 
method of encoding a stereo pair of images, computer executable process steps to process image 
data from a stereo pair of images, and apparatus for processing a stereo pair of images as claimed 
in claims 1-16, comprising substantially the same input which receives a stereo pair of images (see 
10 of Figure 3 and 11, 12 of Figure 4); a foreground extractor (13-15 of Figure 4 and see page 4, 
lines 4-10 of translated article) coupled to the input which compares location of like pixel 
information in each image to determine which pixel information is foreground pixel information 
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and which pixel information is background pixel information, wherein the foreground extractor 
computes the difference in location of like pixels in each image and selects the foreground pixels 
as those pixels whose difference in location falls above a threshold distance; wherein the stereo 
pair of images are received from a stereo pair of cameras spaced closely from one another in a 
video conference system (see Figure 3); the extracting includes identifying the location of like 
pixels in each of the stereo pair of images, calculating the difference between the locations of like 
pixels, and determining for each set of like pixels whether the difference between locations falls 
above a threshold difference, and if so identifying those pixels as foreground information (see 
page 4, lines 4-10 of translated article); a memory which stores process steps (i.e., as provided to 
carry out functions within Figure 4), and a processor which executes the process steps stored in 
the memory so as to extract foreground information from the stereo pair of images, and if the 
difference in location is above a set threshold the pixel information is identified as foreground 
pixel information, if below the set threshold the pixel information is determined to be background 
pixel information (see page 4, lines 4-10 of translated article); 

Stenger does not particularly disclose, though, the folio wings: 

(a) a DCT block classifier coupled to the foreground extractor which determines which 
DCT blocks of at least one of the images contain a threshold amount of foreground information; 
and an encoder coupled to the DCT block classifier which encodes the DCT blocks having the 
threshold amount of foreground pixel information with a first level of quantization and which 
encodes the DCT blocks having less than the threshold amount of foreground information as 
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background information at a second lower quantization level, the encoder encodes the foreground 
pixel information at a first high level of quantization and which encodes background pixel 
information at a second lower level of quantization, wherein the encoding step encodes the entire 
8x8 block of DCT coefficients at the first higher quantization level if the 8 x 8 block of DCT 
coefficients contains the predetermined amount of foreground pixel information as claimed in 
claims 1,4, 7, 8,11,12, and 14-16; 

(b) wherein the foreground pixel information is defined in terms of entire 8x8 blocks of 
DCT coefficients, wherein the encoding step encodes an entire 8x8 block of DCT coefficients as 
foreground information if at least a predetermined number of foreground pixels are within the 8 x 
8 block, otherwise the entire 8x8 block of DCT coefficients is encoded as background 
information as claimed in claims 6 and 10; and 

(c) wherein at least a majority of a bandwidth is encoded at the first quantization level and 
the first high/higher level of quantization as claimed in claims 1, 4, 7, 8, 1 1, and 14. 

Regarding (a) and (b), Katata et al discloses an image encoder as shown in Figure 1, and 
teaches the conventional use of a DCT block classifier (i.e., within 106 of Figure 1, and see 
column 5, lines 1-4) coupled to a foreground extractor (i.e., 101, 102 of Figure 1 and see column 
4, line 45 to column 5, line 4) for determining which DCT blocks of at least one of the images 
contain a threshold amount of foreground information; an encoder (i.e, within 106 of Figure 1, 
and see column 5, lines 1-4) coupled to the DCT block classifier which encodes the DCT blocks 
having the threshold amount of foreground information with a first level of quantization and 



Application/Control Number: 09/196,574 
Art Unit: 2613 



Page 5 



which encodes the DCT blocks having less than the threshold amount of foreground information 
as background information (i.e., background information is being provided by the threshold 15 of 
Figure 4 of Stenger et al) at a second lower quantization level (see column 1, lines 12-25, columns 
7-8), the encoder encodes the foreground pixel information at a first high level of quantization and 
which encodes background pixel information at a second lower level of quantization (see column 
1, lines 12-25, columns 7-8), wherein the encoding step encodes the entire 8x8 block of DCT 
coefficients at the first higher quantization level if the 8 x 8 block of DCT coefficients contains the 
predetermined amount of foreground pixel information (see column 1, lines 12-58, columns 7-8); 
wherein the foreground pixel information is defined in terms of entire 8x8 blocks of DCT 
coefficients, wherein the encoding step encodes an entire 8x8 block of DCT coefficients as 
foreground information if at least a predetermined number of foreground pixels are within the 8 x 
8 block, otherwise the entire 8x8 block of DCT coefficients is encoded as background 
information (see column 1, lines 12-58, columns 7-8). Therefore, it would have been obvious to 
one of ordinary skill in the art, having the Stenger and Katata et al references in front of him/her 
and the general knowledge of stereo image processings within videophone environments, would 
have had no difficulty in providing the DCT block classifier and an encoder for providing different 
quantization level processings for foreground and background image data as taught by Katata et 
al for the stereo image videophone system within Stenger for the same well known image 
compressions purposes as claimed. 
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Regarding (c), it is noted that Katata et al does teach the particular finer quantization for 
areas of interest, such as the facial region (i.e., foreground data, see column 1, lines 12-25, 
column 4, lines 45-61, column 5, lines 5-20, column 7, line 26 to column 8, line 23, column 9, line 
36 to column 10, line 13, Figures 1, 13, 14b, 17). It is well recognized in the art that finer 
quantization requires more bandwidth. And though Katata is silent as to where a majority of the 
bandwidth is encoded, it is nevertheless considered obvious that a majority of a bandwidth is 
encoded for the foreground data (i.e., facial region) since a finer quantization level is required. In 
any event, Woodfill et al discloses a data processing system and teaches the conventional 
allocation of a majority of bandwidth for foreground information over background information 
(see column 123, lines 34-49). Therefore, it would have been obvious to one of ordinary skill in 
the art, having the Stenger, Katata et al, and Woodfill et al references in front of him/her and the 
general knowledge of foreground/background encoding of video data, would have had no 
difficulty in using the particular majority of bandwidth allocation for foreground data as taught by 
Woodfill et al to provide a majority of a bandwidth to be encoded at the first quantization level 
and the first high/higher level of quantization for the foreground data of Katata et al and Stenger 
for the same well known image quality control and bandwidth allocation control purposes as 
claimed. 
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5. Claims 1-16 are rejected under 35 U.S.C. 103(a) as being unpatentable over Stenger of 
record (DE 3608489A1) in view of Katata et al of record (5,815,601) and Monro et al 
(6,078,619). 

Stenger discloses a method of improving image segmentation of a video telephone scene 
as shown in Figures 3 and 4, and substantially the same image processing device and system, 
method of encoding a stereo pair of images, computer executable process steps to process image 
data from a stereo pair of images, and apparatus for processing a stereo pair of images as claimed 
in claims 1-16, comprising substantially the same input which receives a stereo pair of images (see 
10 of Figure 3 and 11,12 of Figure 4); a foreground extractor (13-15 of Figure 4 and see page 4, 
lines 4-10 of translated article) coupled to the input which compares location of like pixel 
information in each image to determine which pixel information is foreground pixel information 
and which pixel information is background pixel information, wherein the foreground extractor 
computes the difference in location of like pixels in each image and selects the foreground pixels 
as those pixels whose difference in location falls above a threshold distance; wherein the stereo 
pair of images are received from a stereo pair of cameras spaced closely from one another in a 
video conference system (see Figure 3); the extracting includes identifying the location of like 
pixels in each of the stereo pair of images, calculating the difference between the locations of like 
pixels, and determining for each set of like pixels whether the difference between locations falls 
above a threshold difference, and if so identifying those pixels as foreground information (see 
page 4, lines 4-10 of translated article); a memory which stores process steps (i.e., as provided to 
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carry out functions within Figure 4), and a processor which executes the process steps stored in 
the memory so as to extract foreground information from the stereo pair of images, and if the 
difference in location is above a set threshold the pixel information is identified as foreground 
pixel information, if below the set threshold the pixel information is determined to be background 
pixel information (see page 4, lines 4-10 of translated article); 

Stenger does not particularly disclose, though, the followings: 

(a) a DCT block classifier coupled to the foreground extractor which determines which 
DCT blocks of at least one of the images contain a threshold amount of foreground information; 
and an encoder coupled to the DCT block classifier which encodes the DCT blocks having the 
threshold amount of foreground pixel information with a first level of quantization and which 
encodes the DCT blocks having less than the threshold amount of foreground information as 
background information at a second lower quantization level, the encoder encodes the foreground 
pixel information at a first high level of quantization and which encodes background pixel 
information at a second lower level of quantization, wherein the encoding step encodes the entire 
8x8 block of DCT coefficients at the first higher quantization level if the 8 x 8 block of DCT 
coefficients contains the predetermined amount of foreground pixel information as claimed in 
claims 1,4, 7, 8, 11, 12, and 14-16; 

(b) wherein the foreground pixel information is defined in terms of entire 8x8 blocks of 
DCT coefficients, wherein the encoding step encodes an entire 8x8 block of DCT coefficients as 
foreground information if at least a predetermined number of foreground pixels are within the 8 x 
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8 block, otherwise the entire 8x8 block of DCT coefficients is encoded as background 
information as claimed in claims 6 and 10; and 

(c) wherein at least a majority of a bandwidth is encoded at the first quantization level and 
the first high/higher level of quantization as claimed in claims 1, 4, 7, 8, 1 1, and 14. 

Regarding (a) and (b), Katata et al discloses an image encoder as shown in Figure 1, and 
teaches the conventional use of a DCT block classifier (i.e., within 106 of Figure 1, and see 
column 5, lines 1-4) coupled to a foreground extractor (i.e., 101, 102 of Figure 1 and see column 
4, line 45 to column 5, line 4) for determining which DCT blocks of at least one of the images 
contain a threshold amount of foreground information; an encoder (i.e, within 106 of Figure 1, 
and see column 5, lines 1-4) coupled to the DCT block classifier which encodes the DCT blocks 
having the threshold amount of foreground information with a first level of quantization and 
which encodes the DCT blocks having less than the threshold amount of foreground information 
as background information (i.e., background information is being provided by the threshold 15 of 
Figure 4 of Stenger et al) at a second lower quantization level (see column 1, lines 12-25, columns 
7-8), the encoder encodes the foreground pixel information at a first high level of quantization and 
which encodes background pixel information at a second lower level of quantization (see column 
1, lines 12-25, columns 7-8), wherein the encoding step encodes the entire 8x8 block of DCT 
coefficients at the first higher quantization level if the 8 x 8 block of DCT coefficients contains the 
predetermined amount of foreground pixel information (see column 1, lines 12-58, columns 7-8); 
wherein the foreground pixel information is defined in terms of entire 8x8 blocks of DCT 
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coefficients, wherein the encoding step encodes an entire 8x8 block of DCT coefficients as 
foreground information if at least a predetermined number of foreground pixels are within the 8 x 
8 block, otherwise the entire 8x8 block of DCT coefficients is encoded as background 
information (see column 1, lines 12-58, columns 7-8). Therefore, it would have been obvious to 
one of ordinary skill in the art, having the Stenger and Katata et al references in front of him/her 
and the general knowledge of stereo image processings within videophone environments, would 
have had no difficulty in providing the DCT block classifier and an encoder for providing different 
quantization level processings for foreground and background image data as taught by Katata et 
al for the stereo image videophone system within Stenger for the same well known image 
compressions purposes as claimed. 

Regarding (c), it is noted that Katata et al does teach the particular finer quantization for 
areas of interest, such as the facial region (i.e., foreground data, see column 1, lines 12-25, 
column 4, lines 45-61, column 5, lines 5-20, column 7, line 26 to column 8, line 23, column 9, line 
36 to column 10, line 13, Figures 1, 13, 14b, 17). It is well recognized in the art that finer 
quantization requires more bandwidth. And though Katata is silent as to where a majority of the 
bandwidth is encoded, it is nevertheless considered obvious that a majority of a bandwidth is 
encoded for the foreground data (i.e., facial region) since a finer quantization level is required. In 
any event, Monro et al discloses an object oriented video system and teaches the conventional use 
of a bit rate manager 42 of Figure 1 for allocation of a majority of bandwidth for foreground 
information over background information (see column 2, lines 55-63, column 5, lines 30-37, 
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column 6, lines 7-17). Therefore, it would have been obvious to one of ordinary skill in the art, 
having the Stenger, Katata et al, and Monro et al references in front of him/her and the general 
knowledge of foreground/background encoding of video data, would have had no difficulty in 
using the particular majority of bandwidth allocation for foreground data as taught by Monro et al 
to provide a majority of a bandwidth to be encoded at the first quantization level and the first 
high/higher level of quantization for the foreground data of Katata et al and Stenger for the same 
well known image quality control and bandwidth allocation control purposes as claimed. 
6. Any response to this action should be mailed to: 

Commissioner of Patents and Trademarks 
Washington, D.C. 20231 

or faxed to: 

(703) 872-9314, (for formal communications intended for entry) 
(for informal or draft communications, please label "PROPOSED" or "DRAFT") 

Hand-delivered responses should be brought to Crystal Park II, 2 12 1 Crystal Drive, 
Arlington. VA., Sixth Floor (Receptionist). 
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7. Any inquiry concerning this communication or earlier communications from the examiner 
should be directed to Richard Lee whose telephone number is (703) 308-6612. The Examiner can 
normally be reached on Monday to Friday from 8:00 a.m. to 5:30 p.m, with alternate Fridays off. 

Any inquiry of a general nature or relating to the status of this application should be 
directed to the Group customer service whose telephone number is (703) 306-0377. 




Richard Lee/rl 



